How to prevent adjoining in TAGs and its impact on the Average Case Complexity

نویسنده

  • Jens Woch
چکیده

The introduction of the adjoining operation to context-free grammars comes at high costs: The worst case time complexity of (Earley, 1968) is O n 3 ¡ , whereas Tree Adjoining Grammars have O n 6 ¡ ((Schabes, 1990)). Thus, avoiding adjoining as far as possible seems to be a good idea for reducing costs (e.g.) address this problem more radically by restricting the adjoining operation of TAGs such that it is no context-sensitive operation anymore. The result is O n 3 ¡ worst case parseability which stems from TIG's context-freeness. However, to preserve TAG's mildly context-sensitiveness the adjoining operation must not be restricted in any way. Another solution would be simply to call the adjoining operation less frequently: The production of items directly depends on the fashion of the underlying grammar and often adjoining is used to make the grammar more comprehensible or more adequate to the linguistic phenomenon even if there would be simpler representations as, for instance, left-or right recursion. This abstract (1st) sketches a way of reducing item generation through grammar transformation by using Schema-TAGs (STAGs , as introduced by (Weir, 1987), where tree sets are enumerated by regular expressions) which in contrast to TIGs keeps weak equivalence and performs better than factoring as proposed by (Harbusch, Widmann and Woch, 1998), and (2nd) provides a proof of the average case time complexity of STAGs based on the proposed transformation. In the following, adressing of nodes occurs in the fashion of (Gorn, 1967), i.e. each node of a tree gets a unique number – beginning with zero – which preserves the structure of the tree. For example, 1¢ 2 points to the second daughter of the first daughter of a root node, and in grammar G 1 of Fig. 2,

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Restrictions on Tree Adjoining Languages

Several methods are known for parsing languages generated by Tree Adjoining Grammars (TAGs) in O(n) worst case running time. In this paper we investigate which restrictions on TAGs and TAG derivations are needed in order to lower this O(n) time complexity, without introducing large runtime constants, and without losing any of the generative power needed to capture the syntactic constructions in...

متن کامل

An Efficient Parsing Algorithm for Tree Adjoining Grammars

In the literature, Tree Adjoining Grammars (TAGs) are propagated to be adequate for natural language description-analysis as well as generation. In this paper we concentrate on the direction of analysis. Especially important for an implementation of that task is how efficiently this can be done, i.e., how readily the word problem can be solved for TAGs. Up to now, a parser with O(n 6) steps in ...

متن کامل

Reflective Practice: How the World Bank Explored Its Own Biases?

While many international organisations have independent evaluations, including the International Monetary Fund (IMF) and World Health organization (WHO), uniquely the World Bank in its 2015 World Development Report sought to ascertain the potential biases that influence how its staff interpret evidence and influence policy. Here, we describe the World Bank’s study design, including experiments ...

متن کامل

Translator Education in the Light of Complexity Theory: A Case of Iran’s Higher Education System

In the fast-growing world of translation studies, many students may not receive adequate training at universities. A new multi-facetted approach is therefore needed to be applied in translator educational programs to meet the students’ needs and professional expectations. In order to describe the complex interrelations in translator education systems and propose a research framework that takes ...

متن کامل

بررسی مقایسه‌ای تأثیر برچسب‌زنی مقولات دستوری بر تجزیه در پردازش خودکار زبان فارسی

In this paper, the role of Part-of-Speech (POS) tagging for parsing in automatic processing of the Persian language is studied. To this end, the impact of the quality of POS tagging as well as the impact of the quantity of information available in the POS tags on parsing are studied. To reach the goals, three parsing scenarios are proposed and compared. In the first scenario, the parser assigns...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002